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multiple system buses and an I/O bus to a shared main memory and efficiently 
maintains cache coherency while minimizing the impact to latency and total 
bandwidth within the system. The system provides coherency filters which coordinate 
bus-to-bus communications in such a way as to maintain cache memory coherency with 
a small amount of cross-bus traffic. In addition, the system provides a multiported 
pool of memory cells which interconnect the multiple buses. 

9 Claims, 11 Drawing figures 
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PRIMARY-EXAMINER: Swann; Tod R. 
ASSISTANT-EXAMINER: King, Jr.; Conley B. 
ATTY-AGENT-FIRM: Staas & Halsey 

ABSTRACT: 

A plurality of processors, each with caches provided for a plurality of processor 
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modules and a local storage in which a main storage is distributed and arranged are 
mutually connected through an internal snoop bus. The processor modules are 
mutually connected through a second system bus. By using two separate buses, cache 
coherence operations within a processor group is kept separate from cache coherence 
operations outside the processor group. 

60 Claims, 65 Drawing figures 
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LI: Entry 2 of 37 



File: USPT 



Jun 8, 2004 



DOCUMENT- IDENTIFIER: US 6748465 B2 
TITLE: Local bus polling support buffer 

Detailed Description Text (23) : 

Maintaining a copy of data concerning I/O devices coupled to bus 250 in buffer 242 
allows the number accesses that I/O controller 140 makes to memory 232 to be 
reduced. This reduction in accesses may, in turn, allow memory 232 to be powered 
down at times where memory 232 would otherwise have to be powered up to allow I/O 
controller 240 to make such accesses. In an embodiment of computer system 200 that 
is further comprised of cache 234, this reduction in accesses may also allow cache 
234 to remain powered down along with memory 232 at times where cache 234 would 
otherwise have to be powered up to either respond to accesses being made by I/O 
controller 140, or to take steps necessary to maintain coherency between data 
stored in cache 234 and memory 232. In another embodiment of computer system 200 
that is further comprised of cache 214, this reduction in accesses may also allow 
cache 214 to remain powered down, which may in turn, allow processor 210 to remain 
powered down at times where it would otherwise be necessary to be powered up in 
order to maintain coherency between data stored in cache 214 and memory 232. In 
still another embodiment of computer system 200, both caches 214 and 234 are 
present, either processor 210 or memory 232 and their associated caches may be 
powered down during the normal operation of computer system 200, as determined to 
be appropriate as part of whatever measures are being taken to reduce power 
consumption by computer system 200. In such an embodiment, the reduction in 
accesses to memory 232 would allow whichever ones of processor 210 or memory 232 
and their associated caches to remain powered down. 
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ATTY-AGENT-FIRM: Blakley, Sokoloff, Taylor & Zafman LLP 
ABSTRACT: 

A method and apparatus for allowing memory, cache and/or a processor to remain 
powered down while repetitive transactions are carried out on an I/O bus and 
actions are taken in response to feedback received from I/O devices coupled to the 
I/O bus. 

28 Claims, 3 Drawing figures 
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Ll: Entry 15 of 37 



File: USPT 



Oct 16, 2001 



DOCUMENT- IDENTIFIER: US 6304932 Bl 

TITLE: Queue-based predictive flow control mechanism with indirect determination of 
queue fullness 

Detailed Description Text (30) : 

Finally, each module that has a cache memory, including both processor and 
input/output modules, has a cache coherency queue for storing coherent transactions 
in a first-in first-out ("FIFO") order. A coherent transaction is any transaction 
(such as a read) that results in the need to check other caches to see whether the 
requested data is in the other cache, or to verify that the cache is up-to-date. 
Such transactions are indicated by signals sent during the address cycle for the 
transactions initiated on bus 12. Each module having a cache memory monitors the 
bus and loads coherent transaction into its cache coherency queue, referred to 
herein as CCC queues. The coherent transactions wait in the CCC queue of a 
particular module until that module checks its cache, and reports the results of 
that coherency check to main memory controller 14. In a preferred implementation, 
main memory controller 14 begins reading the main memory as soon as the read 
transaction has been issued. Main memory controller 14 waits until the results of 
the coherency checks are reported by all of the modules, and then responds to the 
coherent transaction. If no client module has a private-dirty copy of the data, 
main memory controller 14 will supply the data from main memory. Otherwise, the 
client module that has a private-dirty copy will supply the data and main memory 
controller 14 will update main memory with the new data value. In a preferred 
implementation, coherency responses are received by main memory controller 14 
quickly enough so that there is no appreciable delay in responding to the 
transaction . 
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CROSS REFERENCE TO RELATED APPLICATION This is a continuation of application Ser. 
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D 5265235 



November 1993 



Sindhu et al. 



711/120 



ART-UNIT: 273 

PRIMARY-EXAMINER: Ellis; Richard L . 
ABSTRACT : 

A shared bus system having a bus and a set of client modules coupled to the bus. 
Each client module is capable of sending transactions on the bus to other client 
modules and receiving transactions on the bus from other client modules for 
processing. Each module has a queue for storing transactions received by the module 
for processing. A bus controller limits the types of transactions that can be sent 
on the bus to prevent any module's queue from overflowing. 

3 Claims, 2 Drawing figures 
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DOCUMENT -IDENTIFIER : US 6182176 Bl 

** See image for Certificate of Correction ** 

TITLE: Queue-based predictive flow control mechanism 

Detailed Description Text (30) : 

Finally, each module that has a cache memory, including both processor and 
input/output modules, has a cache coherency queue for storing coherent transactions 
in a first-in first-out ("FIFO") order. A coherent transaction is any transaction 
(such as a read) that results in the need to check other caches to see whether the 
requested data is in the other cache, or to verify that the cache is up-to-date. 
Such transactions are indicated by signals sent during the address cycle for the 
transactions initiated on bus 12. Each module having a cache memory monitors the 
bus and loads coherent transaction into its cache coherency queue, referred to 
herein as CCC queues. The coherent transactions wait in the CCC queue of a 
particular module until that module checks its cache, and reports the results of 
that coherency check to main memory controller 14. In a preferred implementation, 
main memory controller 14 begins reading the main memory as soon as the read 
transaction has been issued. Main memory controller 14 waits until the results of 
the coherency checks are reported by all of the modules, and then responds to the 
coherent transaction. If no client module has a private-dirty copy of the data, 
main memory controller 14 will supply the data from main memory. Otherwise, the 
client module that has a private-dirty copy will supply the data and main memory 
controller 14 will update main memory with the new data value. In a preferred 
implementation, coherency responses are received by main memory controller 14 
quickly enough so that there is no appreciable delay in responding to the 
transaction. 
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OTHER PUBLICATIONS 



U.S. application No. 0497054A3, Sindhu et al., filed Aug. 5, 1992. 
U.S. application No. 0317468A3, Hammer et al., filed May 24, 1989. 

ART-UNIT: 273 

PRIMARY-EXAMINER: Ellis; Richard L. 
ABSTRACT: 

A shared bus system having a bus and a set of client modules coupled to the bus. 
Each client module is capable of sending transactions on the bus to other client 
modules and receiving transactions on the bus from other client modules for 
processing. Each module has a queue for storing transactions received by the module 
for processing. A bus controller limits the types of transactions that can be sent 
on the bus to prevent any module's queue from overflowing. 

1 Claims, 2 Drawing figures 
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Ll: Entry 35 of 37 



File: USPT 



May 12, 1992 



DOCUMENT-IDENTIFIER: US 5113514 A 

TITLE: System bus for multiprocessor computer system 



Abstract Text (1) : 

The invention comprises a system bus apparatus and method for a multi-arm, 
multiprocessor computer system having a main memory and localized buffer cache 
memories at each processor. Each block of data in a cache includes tag bits which 
identifies the condition of the data block in relation to the corresponding data in 
main memory and other caches. The system bus (SYSBUS) comprises three subparts; 1) 
a MESSAGE/DATA bus, 2) a REQUEST /GRANT bus and 3) a BCU bus. The MESSAGE/DATA bus 
is coupled to every device on the system and is used for transferring messages, 
data and addresses. The REQUEST/GRANT bus couples between every device on an arm of 
the system and that arm's bus control unit (BCU) . The BCU bus couples between the 
various BCUs . Both the MESSAGE/DATA bus and the BCU bus include ACK/NACK/HIT bits 
which are used when responding to messages received over the SYSBUS to inform the 
message-issuing device if the devices received the message and, if so, the 
condition of the data in relation to other caches and main memory. The protocol 
allows inconsistent copies of data to exist and prevents stale data from being used 
erroneously by monitoring the tag bits and the ACK/NACK/HIT bits. Further, under 
the appropriate conditions, a copy of the most recent data block may be transferred 
from one cache to another (with appropriate updating of tags) without updating the 
main memory. When a memory operation will bring about a situation where cache 
coherence can no longer be maintained, main memory is updated with the most recent 
copy of the data and the other caches are either updated or tagged as invalid. 
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OTHER PUBLICATIONS 

J. Archibald & J. Baer, Cache Coherence Protocols: Evaluation Using A 
Multiprocessor Simulation Model, ACM Transactions on Computer Systems, vol. 4, No. 
4 (Nov. 1986). 

ART-UNIT: 232 

PRIMARY-EXAMINER : Bowler; Alyssa H. 
ATTY-AGENT-FIRM: Wolf, Greenfield & Sacks 

ABSTRACT: 

The invention comprises a system bus apparatus and method for a multi-arm, 
multiprocessor computer system having a main memory and localized buffer cache 
memories at each processor. Each block of data in a cache includes tag bits which 
identifies the condition of the data block in relation to the corresponding data in 
main memory and other caches. The system bus (SYSBUS) comprises three subparts; 1) 
a MESSAGE/DATA bus, 2) a REQUEST /GRANT bus and 3) a BCU bus. The MESSAGE/DATA bus 
is coupled to every device on the system and is used for transferring messages, 
data and addresses. The REQUEST/GRANT bus couples between every device on an arm of 
the system and that arm's bus control unit (BCU). The BCU bus couples between the 
various BCUs . Both the MESSAGE /DATA bus and the BCU bus include ACK/NACK/HIT bits 
which are used when responding to messages received over the SYSBUS to inform the 
message-issuing device if the devices received the message and, if so, the 
condition of the data in relation to other caches and main memory. The protocol 
allows inconsistent copies of data to exist and prevents stale data from being used 
erroneously by monitoring the tag bits and the ACK/NACK/HIT bits. Further, under 
the appropriate conditions, a copy of the most recent data block may be transferred 
from one cache to another (with appropriate updating of tags) without updating the 
main memory. When a memory operation will bring about a situation where cache 
coherence can no longer be maintained, main memory is updated with the most recent 
copy of the data and the other caches are either updated or tagged as invalid. 

28 Claims, 19 Drawing figures 
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Memory controller for controlling memory accesses across 
networks in distributed shared memory processing systems 



KWIC 



Brief Summary Text - BSTX (27): 

A cache coherency system for a shared memory parallel processing system 
including plurality of processing nodes. A multi-stage communication network 
is provided for interconnecting the processing nodes. Each processing node 
includes one or more caches for storing a plurality of cache lines. A cache 
coherency directory is distributed to each of the nodes for tracking which of 
the nodes have copies of each cache line. A plurality of send FIFOs and 
receive FIFOs are used at each node adapter to segregate and handle invalidate 
functions, remote stores, and remote reads requiring cache coherency. 
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[57] ABSTRACT 

A nhflT ftri m ggnra y pAr flTI^I pw><y <p»fi g tgyctfrm fatCtDQ gae cEed 

by ft multi-stage network combines new system configure 
tic© technique* with special-purpose httdwttt to proviso 
remote mcasxy accesses across Ibc network, while coalrol- 
Uig ache cefcewocy efflc&nSy across tiu network. The 
system configuration techniques include « systematic 
me!bod foi paitilajoing and ccntroHinsg the mwuoiy is 
relation to local verses i emote accesses and changeable 
Tcraca unchangeable data. Mbsi of the special .purpose bird, 
ware b Implemented in the memory controller tod network 
adapter, which implement three send FIFOs and ihrec 
receive FIFOs at each node to segregate and handle effi- 
ciently invalidate fanmcna, remote Blares, and remote 
accesses requiting cscae coherency. The segregation of 
these three functions Into different send and receive FIFOs 
greatly facilitates the cache coherency function over the 
network, la addition, die network itself is tailored to provide 
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